A Hybrid Approach for Data Mart Schema Design from NL-OLAP Requirements
نویسندگان
چکیده
OLAP systems remain difficult to develop for two primary reasons. The first reason is that the REQ analysis step is typically overlooked [1]. The second reason is that, even when attention is accorded to this step, REQ are described from a too technical perspective for decision makers to understand and validate [2]. However, the lack of user involvement in the specification can easily lead to the failure of the project. To ensure the involvement of decision makers in REQ specification, two solutions are offered: non technical notations for OLAP REQ expression (cf., tabular format [3]) and approaches to assist decision makers in identifying their REQ (cf., goal-driven approaches with either a graphical notation [1] or SQL scenarios [4], reuse of MultiDimensional (MD) patterns representing generic OLAP REQ [5]). Despite the assistance they offer, the proposed REQ notations and specification approaches presume that the decision makers are familiar with the MD concepts and their relationships. Within this context, our proposed solution (Fig.1) aims at facilitating the involvement of decision makers in the REQ specification step by offering a NL based notation for expressing the OLAP REQ, and a validation approach to ensure that the specified REQ are realizable. As illustrated in Fig.1, our approach relies on a template for OLAP REQ identification [6]. This template is composed of elements derived from the decision making process to identify the analyzed business process, the relevant actors, the indicators formulas, and the analytical queries used for decision making.
منابع مشابه
Computer-aided Data-mart Design
With decision support systems, decision-makers analyse data in data marts extracted from production bases. The data-mart schema design is generally performed by expert designers (administrator or computer specialist). With data-driven, requirement-driven or hybrid-driven approaches, this designer builds a data-mart defining facts (analysis subjects) and analysis axes. This process, based on dat...
متن کاملAn Improved Semantic Schema Matching Approach
Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...
متن کاملTowards an Automatic Data Mart Design
The Data Warehouse design involves the definition of structures that enable an efficient access to information. The designer builds a multidimensional structure taking into account the users requirements. In fact, it is a highly complex engineering task that calls for a methodological support. This paper lays the grounds for an automatic, stepwise approach for the generation of data warehouse a...
متن کاملAdapting Multidimensional Schemes to Data sources using Algebraic Operators
Designing a decisional system requires a methodology different from those commonly adopted for operational information systems. In our methodology data marts are constructed on the basis of user requirements specified using OLAP design patterns. Since these patterns are independent of any data source, the data mart design process should solve the problems due to differences between user OLAP re...
متن کاملData Mart A Data Warehouse Data Mart B Data
EEcient query processing is a critical requirement for data warehousing systems as decision support applications often require minimum response times to answer complex, ad-hoc queries having aggrega-tions, multi-ways joins over vast repositories of data. This can be achieved by fragmenting warehouse data. The data fragmentation concept in the context of distributed databases aims to reduce quer...
متن کامل